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1 INTRODUCTION 

Let Xi, X2, ■ ■ ■ , Xn, ■ ■ ■ be a discrete-time stochastic process, whose distribution 
depends on an unknown "parameter" 9. We consider the classical problem of testing 
multiple hypotheses Hi: 6 = 61, H2 : 6 = 62, ■ ■ ■ , Hk : 9 = 9k, k>2. 

The main goal of this article is to characterize the structure of optimal sequential 
tests in this problem. 

Let us suppose that for any n = 1,2, ... , the vector {Xi,X2, . . . , Xn) has a 
probability "density" function 

fg{xi,X2,...,Xn) (1) 

(Radon-Nicodym derivative of its distribution) with respect to a product-measure 

— ^ (K) ^ (Xi • • • (Xi ^i, 
n times 

for some cr-finite measure fj, on the respective space. 

We define a (randomized) sequential hypothesis test as a pair {^p, </>) of a stopping 
rule tp and a decision rule (f>, with 



4'= ('i/'l,V'2,---,V'n,---)> 



and 

(P= (01,</'2,---,<?i'n,---)- 
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The functions 

'4'n = '4'n{xi,X2,...,Xn), n = 1,2, . . . , 

are supposed to be some measurable functions with values in [0, 1]. The functions 

(l>n = (l>n{xi,X2,...,Xn), n = 1,2, . . . 

are supposed to be measurable vector-functions with k non-negative components 

(t>i = ^Uxi,---,xny- 

such that Yli=i = 1 for any n = 1,2, 

The interpretation of all these elements is as follows. 

The value of i/'„(xi, . . . , Xn) is interpreted as the conditional probability to stop 
and proceed to decision making, given that we came to stage n of the experiment 
and that the observations up to stage n were {xi,X2, ■ ■ ■ ,Xn)- If there is no stop, 
the experiments continues to the next stage and an additional observation x„_|_i is 
taken. Then the rule ijjn+i is applied to xi,X2, ■ ■ ■ ,Xn, Xn+i in the same way as as 
above, etc., until the experiment eventually stops. 

It is supposed that when the experiment stops, a decision to accept some of 
Hi,. . . ,Hk is to be made. The function . . . ,Xn) is interpreted as the con- 

ditional probability to accept Hi, i = 1,. . . ,k, given that the experiment stops at 
stage n being (xi, . . . , a;„) the data vector observed. 

The stopping rule ip generates, by the above process, a random variable T0 {stop- 
ping time) whose distribution is given by 

Pg{T^ = n) - Eg{l - ^i)(l - V^a) ... (1 - iJn-l)i'n- 

Here, and throughout the paper, we interchangeably use ipn both for ijjn{xi,xi, . . . , Xn) 
and for . . . and so do we for any other function of observations 

_F„. This does not cause any problem if we adopt the following agreement: when 
F„ is under probability or expectation sign, it is i^„(Xi, . . . , otherwise it is 

^n^Xi, . . . , Xn)' 

For a sequential test {ip, (j)) let us define 

oo 

ay (t/., 0) = Pe. ( accept if^) = ^ i;^. (1 - ^i) . . . (1 - Vn-i)V'n0^„ (2) 

Tl=l 

and 

/3j('!/', (j)) = ( accept any iJj different from JJj) = ctijii^, (p), (3) 

i = 1, . . . ,k, j = 1, . . . ,k. The probabilities aij{il), (j)) for j ^i can be considered "in- 
dividual" error probabilities and (j)) "gross" error probability, under hypothesis 
Hi, of the sequential test {tp,(j)). 

Another important characteristic of a sequential test is the average sample num- 
ber. 

N{6;i,) = Eer^ = {^-^ "''^^^'^ = '^''^^^'^ <oo) = l, 
I oo otherwise. 
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Let 9 be any fixed (and known) value of the parameter (we do not suppose, 
generally, that 9 is one oi 9i, i = 1, . . . k). 

In this article, we solve the two following problems: 

Problem I. Minimize N{i/j) = N{9; ip) over all sequential tests (f) subject to 

aij{Tp, (j)) < aij, for alH = 1, . . . fc, and for all j ^ i, (5) 

where G (0, 1) (with i, j ^ I, . . . k, j ^ i) are some constants. 

Problem II. Minimize N{ip) over all sequential tests (f) subject to 

A(V',0)<A, foralH = l,...fc, (6) 

with some constants j3i G (0, 1), i — 1, . . . , k. 

More general problems of minimizing an average cost of type 

oo 

7V(V') = EeCnil - ^l) ... (1 - ^n-l)^n 

71 = 1 

with some cost function C„ = Cn{Xi, X2, ■ ■ ■ Xn) can be treated in essentially the 
same manner. 

If fc = 2 then Problems I and II are equivalent, because (3i{ip, cj)) = ai2('0, 0), and 
/32(^/',0) =a2i(^,0), by @ and ®. 

For independent and identically distributed (i.i.d.) observations and k ~ 2 the 
formulated problem, when 9 ^ 9i and 9 92, is known as the modified Kiefer- Weiss 
problem (see |10|). being the original Kiefer- Weiss problem minimizing snpg N^ip) 
under (O (see [6]). 

For the latter problem, taking into account the usual relations between Bayesian 
and minimax procedures, it seems to be reasonable to generalize our problem of 
minimizing N(9; ip) to that of minimizing 

J N{9-i>)d^{9), 

with some "weight" measure tt. From what follows it is easily seen that, under 
natural measurability conditions, our method works as well for this latter problem. 

In Section 2, we reduce Problems I and II to an unconstrained minimization 
problem. The new objective function is the Lagrange- multiplier function £(-0; (j)). 

In Section 3, we find 

L(V)=infL(0,0), 

4> 

where the infimum is taken over all decision rules. 

In Section 4, we minimize L{^) in the class of truncated stopping rules, i.e. such 
that = 1 for some < TV < 00. 

In Section 5, we characterize the structure of optimal stopping rule '0 in the class 
of all stopping rules. 

In Section 6, we apply the results obtained in Sections 2 - 5 to the solution of 
Problems I and II. 
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2 REDUCTION TO NON-CONSTRAINED MIN- 
IMIZATION 

In this section, the Problems I and II wiU be reduced to unconstrained optimization 
problems using the idea of the Lagrange multipliers method. 

2.1 Reduction to Non-Constrained Minimization in Problem I 

To proceed with minimizing N{ijj) over the sequential tests subject to (O, let us 
define the following Lagrange-multiplier function: 

L{ij,(f>) = N{^)+ (7) 

1<Z j'<fc; i^j 

where Ay > are some constant multipliers. Recall that N{ip) = Eqt^, where 9 
is the fixed value of parameter for which the average sample number (|4]) is to be 
minimized. Generally, we do not suppose that 6 is one of 0^, z = 1, . . . , /c. 
Let A be a class of tests. 

The following theorem is a direct application of the Lagrange multipliers method. 

Theorem 1. Let exist Ay > 0, i = l,...,fc, j = \,...^k, j ^ i, and a test 
("0*, (j)*) e A such that for all sequential tests (ip, (/>) G A 

Lir,r)<Li^,^) (8) 

holds and such that 

aij{ip* 1 0*) = Oiij for all i = 1, . . . fc, and for all j ^ i. (9) 
Then for all [tp, 0) g A such that 

aij{il), (j)) < aij for all i = 1, . . . k, and for all j ^ i, (10) 

it holds 

N{ip*) < N{il:). (11) 
The inequality in ill}) is strict if at least one of the equalities IJOj) is strict. 
Proof. Let {ipjtj)) G A be any sequential test satisfying (fTU]) . Because of ^ 

< Li^j, 0) = N{iP) + J2 Ay-ay (^, 0) < iV(0) + ^ Ayay , (12) 

where to get the last inequality we used ([5]). 
So, 
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and taking into account conditions © we get from this that 

To get the last statement of the theorem we note that if N{ip*) = N{iJ;) then there 
are equaUties in p2p instead of inequahties which is only possible if aij{ijj, (j)) — Uij 
for any i, j = 1, . . . fc, j 7^ i. □ 

Remark 1. The author owes the idea of the use of the Lagrange- multiplier method 
in sequential hypotheses testing to Berk [l]. Essentially, the method of Lagrange 
multipliers is implicitly used in the monograph of Lehmann [7] in the proof of the 
fundamental lemma of Neyman-Pearson. In a way, the Bayesian approach in hy- 
potheses testing can be considered as a variant of the Lagrange-multiplier method 
as well. 

Remark 2. All our results below can be adapted to the Bayesian context by choos- 
ing appropriate Lagrange multipliers and using 

fc 

i=l 

instead of N{9] ip) in ^("0, 0) above. From this point of view, we extend and com- 
plement the results of Cochlar [2] about the existence of Bayesian sequential tests. 
More generally, all our results are applicable as well for minimization of 

j N{9-^/j)dTT{0), 

where tt is any probability measure (see Remarks [5] and [11] below) . 

2.2 Reduction to Non-Constrained Minimization in Problem II 

Very much like in the preceding section, define 

k 

L{i^,cf>)=N{^) + Y,KHi^,4>), (13) 

where Ai > are the Lagrange multipliers. 

In a very similar manner to Theorem [1] we have 

Theorem 2. Let exist Ai > 0, i = 1, . . . , fc, and a sequential test {ip* , (ff) G A such 
that for all {ip.cj)) G A 

L{rA*)<L{^.4>) (14) 

holds and such that 

A(V'*, (/)*)= ft for all i = l,...k. (15) 
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Then for all sequential tests {ip, 0) G A such that 

/3,(V', <?!))< A for all i^l,...k, (16) 

it holds 

N{iP*) < iV(?/'). (17) 
The inequality in |j7|j is strict if at least one of the equalities Iil6\) is strict. 



3 OPTIMAL DECISION RULES 

Due to Theorems [T] and Problem I is reduced to minimizing ([7]) and Problem 11 
is reduced to minimizing (fT3)) . But (fT3|) is a particular case of ([7]), namely, when 
Ay = Xi for any j = 1, . . . ,k, j i (see 1^ and ([3])). Because of that, from now on, 
we will only solve the problem of minimizing L(i/;, </>) defined by ([7]). 
In particular, in this section we find 

and the corresponding decision rule (p, at which this infimum is attained. 
Let I A be the indicator function of the event A. 

Theorem 3. For any \ij >Q, i = \, . . . ,k, j ^ i, and for any sequential test (jp, 4>) 

CO „ 

L{i;, </)) > iV(^) + ^ / (1 - ^i) . . . (1 - ^„_i)^„/„d/x", (18) 

where 

l<j<k ^ — ' 

Supposing that N^ip) is finite, the right-hand side of il8\) is attained if and only if 
for all j — 1, ... k, ij,"' -almost anywhere on 

— {(2^1 7 ■ ■ ■ I ^n) ■ {xi , ■ • ■ , Xn) > 0}, 

where s^ixi, . . . , a;„) = = (1 - -(/ii) . . . (1 - V'«-i)^/'n, for all 1,2, 

Proof. Inequality is equivalent to 

oo „ 

^ A,,ay(^,0) > ^ /(l-^l)...(l-V'n-l)V'nWAi". (21) 

l<i,j<k:,j=jti n=l-' 

We prove it by finding a lower bound for the left-hand side of ((2T|) and proving 
that this lower bound is attained if (p satisfies (PU)) . 
To do this, we will use the following simple 
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Lemma 1. Let 01, . . . , 0^ and Fi, . . . Fk be some measurable non-negative functions 
on a measurable space with a measure fj,, such that 

k 

^(/),(a;) EE 1, 

1=1 

and such that 



min Fi(x)dLi(x) < oo. 

l<i<k 



Then 



with an equality in \2^) if and only if 



j \ y24>^{x)F,{x)\d^l{x)> J mm^F,ix)d^iix) (22) 



^^{F.^ minFj} V « = 1, 2, . . . , /c, (23) 



i<j<k 
fi-almost anywhere. 

Proof. To prove ([H]) it suffices to show that 

k 



J \^^(l)^{x)F,{x)j dii{x) ~ j ^n^F,{x)dii{x) > 0, (24) 

because the second integral is finite by the conditions of the Lemma. 
But ([M)) is equivalent to 

/ Y,Ux){F,{x) - min FJ{x))d^l{x) > 0, (25) 

being this trivial because the function under the integral sign is non-negative. 
Because of this, there is an equality in (P5|) if and only if 

k 

y (f>.,{x){F,{x) - min F,(x)) ^ 

i—1 



/i-almost anywhere, which is only possible if ((23)) holds true. □ 
Starting with the proof of (j2ip . let us give to the left-hand side of it the form 

A,jay(V',(/)) 

l<i,j<k;j^i 

= E / (1 - '/'l) ■ ■ ■ (1 - V-n-O^n E E ^ndli'' (26) 
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(see ©). 

Applying Lemma [1] to each summand in (|26p we immediately have: 

A,,a,,(^,0) > ^ /(l-^i)...(l-V„_i)^„?„dM" (27) 

l<i,j<k-j^i n=l-' 

with an equality if and only if 

for all 1 < j < fc, /i"-almost anywhere on S*^, for all n = 1, 2, . . . . □ 
Remark 3. It is easy to see, using ^ and {21^ , that 

L(V) =infi(V,'^) = E / (l-^i)---(l-^n-i)^«(n/e" + ;„)dA'" (28) 

n— 1 

if Pe{T^ < oo) — 1 and L{'ip) — oo otherwise. 

Remark 4. In the Bayesian context of Remark\^ the "if "-part of Theorem\3l can 
also be derived from Theorem 5.2.1 J^. 

4 TRUNCATED STOPPING RULES 

Our next goal is to find a stopping rule ip minimizing the value of L{ip) in (j28p . 

In this section, we solve, as an intermediate step, the problem of minimization of 
L(V') in the class of truncated stopping rules, that is, in the class of 

V' = (V'i,V'2,...>Jv-i,l,...)- (29) 
For any ip 6 let us define 

N 

LnW = ^ / (1 - ^l) . . . (1 - ^„-l)Vn {nf^ + In) diJ^^ (30) 

n=l 

(see ([28l)V 

The following lemma takes over a large part of work of minimizing Ln{'4') over 
^ G A^. 

Lemma 2. Let r > 2 be any natural number, and let Vr = Vr{xi, X2, ■ ■ • , Xr) be any 
measurable function such that J VrdijT < oo. Then 



J2 / (1 - V^l) . . . (1 - ^Pn-l)Mnfg + ln)dfl'' 
n=l 

+ /(I - ^l) ... (1 - ^r-l) {rfS + Vr) d^l' 
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r-2 „ 

^ E / (1 - V-i) • • • (1 - ^n^i)Mnfe + ln)diJ^ (31) 

n=l 

+ /(I - V^i) . . . (1 - ((r - l)fl-^ + 



Vr-l = min{?r_i, /g"^ + / Vr{xi, . .. , Xr)dfi{Xr)}. (32) 



There is an equality in Ii31\) if and only if 

fj,''^^ -almost anywhere on Cf_i, where, by definition, 

Ct = {{xi,...,Xn) ■■ (1 - V'l(xi)). .. (1 - lpn-l{xi,...,Xn-l)) > 0} 

for any n — 1,2, . . . 

Proof. Let us start with the following simple consequence of Lemma [1] 

Lemma 3. Let x, 0, ^i, F2 be some measurable functions on a measurable space with 
a measure fi, such that 

0<x(2;)<l, 0<(t>{x)<l, Fi{x)>0, F2{x)>0, 

and 

Jmin{F,ix),F2ix)}d,ix)<oo. 

Then 

xix){^ix)Fiix) + {l-^ix))F2ix))dfiix)> J xix)mm{Flix),F2ix)}d^lix) (34) 
with an equality if and only if 

I{Fi{x)<F2{x)} < (f'ix) < I{Fi{x)<F2(x)} (35) 

fi-almost anywhere on {x : x{x) > 0}. 

Proof. Defining 4'i{x) = (j^ix) and 4'2{x) = 1 — fpix), from Lemma[T]we immediately 
obtain (|34p , with an equality if and only if 



4>l{x) = 4>{x) < I{x(x){Fi{x)-min{Fi{x),F2{x)})=0} (36) 

and 

(j)2{x) = 1 - (l){x) < I{x(x){F2{x)^min{Fi{x),F2{x)})=0} (37) 

/i-almost anywhere. Expressing (j){x) from (I36|) and (j37[) we have that there is an 



equality in (j34p if and only if 

I{xix)(.F2{x)-min{Fi{x),F2{x)})>0} < H^) < I{x{x)(Fi{x)~min{Fi{x),F2ix)})=0} 
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^-almost anywhere, which is equivalent to 

I{Pii^)<-F'2ix)} < Hx) < I{Fi{x)<F2{x)} M - ahuost anywhere on {x{x) > 0}. 

□ 

To start with the proof of Lemma[l]let us note that for proving (pij) it is sufficient 
to show that 

'(1 - Vl) ... (1 - A-2)i^r-l{{r - l)/r' + lr-l)dfi^~^ 

+ - V'l) . . . (1 - A-i) {rfe + vr) dff 

> y (1 - V-i) ... (1 - A-2) {{r - l)/r' + «r-i) dfi'^-\ (38) 
By Fubini's theorem the left-hand side of ([55]) is equal to 

(1 - Vi) ... (1 - A-2)A~i{{r - l)/r' + lr-i)dpi'-^ 



+ / (1 - Vi) . . . (1 - A-i) [rfe + Vr) dfi{xr)j dfi^-' 

(1 - l^l) . . . (1 - ijr-2)[A-l{{r - l)/r' + Ir-l) 

+ (1 - A-i) J {rfe + Vr) df,{xr)W^-\ (39) 
Because fg{xi, . . . , x,.) is a joint density function of {Xi, . . . , Xr), we have 

fsixi, . . .,Xr)d^i{Xr) = /g"^(xi, . . .,Xr-l), 

so that the right-hand side of transforms to 

(l-Vi)...(l-^.-2)[(r-l)/r' 



+ 1pr-llr-l + (1 - ^r-l)(/;"^ + J Vrdn{Xr))W-\ (40) 

Applying Lemma [3] with 

X = (1 - -01) . . . (1 - ■0r-2), (/' = '0r-l, 
Fl = lr-1, F2 = fg^^ + / Wrrf/Xr, 
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we see that (^0)) is greater or equal than 

(1 - ^i) ... (1 - A-2)[{r - + min{/,.-i, + / Vrdfi{xr)}W^-' 



(1 - T/^i) . . . (1 - iPr-2)[{r - + Vr-iW--\ (41) 

by the definition of Wr-i in ((32|) . 

Moreover, by the same Lemnia[31 (PO]) is equal to (|4ip if and only if (IBS)) is satisfied 
//'"""'^-almost anywhere on Cf_i. □ 

Let now ■)/) S be any truncated stopping rule. 
By (|30|) we have 

JV-l „ 



+ y (1 - V-i) . . . (1 - V-JV-i) (iV/e^^ + /iv) d/^''. (42) 
Let Vff = In- Applying Lemma [2] with r ^ N and vn — we have 

N-2 

LnH^) >Y1 / (1 - '/'i) ■ • • (1 - iM-i)Mnf0 + ln)dti^ 



+ y (1 - V-i) . . . (1 - {{N ~ 1)4^-1 + V^_,) df,''-\ (43) 

where V^_j = min{/7v_i, /j^^^ + ^ d^^x^)}- Also by Lemma 2, the inequahty 
in (I43p is in fact an equality if 

^^-1 =^{U-i</,"-^+/v^dM(x„)}- (44) 
Applying Lemma [2] to the right-hand side of again we see that 

JV-3 » 

iAr(V) > I] / (1 - ^i) ... (1 - i^u-i)Mnfe + Qd^i" 



+ y (1 - Vi) . . . (1 - V^^_3) ((^ - 2)/,^-^ + (45) 

where V/^_2 = min{/Ar_2, f^^'^ + J V/^_idfj,{xN-i)}- There is an equality in (^5]) if 
dM]) holds and 

V'iV-2 = -^{z„_3</,"-^+/y«_,dp(x„_i)}' (46) 

etc. 
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Repeating the applications of Lemma 2, we finally get 

LnW > I {fl + ^^1^) d^' = 1 + / V,^d^l{x,), (47) 



and a series of conditions on ijj, starting from (pS)) . etc., under which L{iji) 

is equal to the right-hand side of (ITT)) . Because Lemma 2 also gives necessary and 
sufficient conditions for attaining the equality, we also have necessary conditions for 
attaining the lower bound in (|T7l) . 

In this way, formally, we have the following 

Theorem 4. Let G he any (truncated) stopping rule. Then for any 1 < r < 
— 1 the following inequalities hold true 



LnW > E / (1 - ^i) • • ■ (1 - i^n-i)Mnfe + ^n)dA^" 

n=l 

+ /(I - ^i) ... (1 - A) {{r + l)/r^ + K+i) d^l^+' (48) 

r— 1 

E /(I - ^l) • • ■ (1 - V'«-l)V'n(n/e" + Z„)dAi" 



r-l 

> 

n=l • 



+ y (1 - ^i) • . . (1 - ^.-i) (^/e + K"^) (49) 

where Vjv = 'iV; recursively for ni = N — \, N — 2, . . .1 

C-min{?™,/^" + i?^}, (50) 

RZ ^ ^Zi^l^ ■ ■ ■ ^^rn) = j V^^-^{xi,X2,-.-,Xra+l)dn{Xra+l)- (51) 

T/ie lower hound in \4-9^ is attained if and only if for any m = r, . . . , iV — 1 
^{ln^<fp+R,r^} ^ V'm < ^ {I < + R,,.} fJ-'^-dlmost anywkcrc on Cf^. (52) 

In particular, conditions I152\) with m — 1,2, ... , N — 1 are necessary and sufficient 
for heing ip = {ipi, . . . , tp^^i, 1, . . . ) an optimal truncated rule in . The minimum 
value of L{tp), over tp G A"'^, is equal to 



1 + J V^^dfiixi) + 



Remark 5. Despite that any ip — (t/^i, . . . , ipN-i, 1, . . . ) satisfying ([52]) for m = 
1, . . . , — 1 is optimal among all truncated tests in A^, it only makes practical 
sense if 

/o > 1 + < 
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where Iq defined as 



Iq = mm Ai, . 

l<7<fc ^ 

l<i<k,i^j 



The reason is that Iq can be considered as "the L{iJ;)" function for a trivial 
sequential test (t/'Oi'^o) which, without taking any observations, makes a decision 
according to any ipo = {(f)Q, ...,(/)§) such that 



^o<^{2:.^,A,,=io}, i<j<k. 



In this case, there are no observations {N{ipo) — 0) and it is easily seen that 

l<i,j<k,i^j 



Thus, the inequality 

lo<l + Ro 

means that the trivial test {ipQ, (po) is not worse than the best truncated test in A^. 

Because of that, we consider VfJ^ defined by ([50)1 for m — 0, where, by definition, 
/g = 1, as the minimum value of L{ip) in A^, in the case it is allowed not to take 
any observations. 

Remark 6. It is not difficult to see from the proof of Lemma 2, that the problem 
of the optimal testing when the cost of the experiment is defined as 

N{^)dT:{e), (53) 

with some measure tt (see Remark [5]), under suitable measurability conditions, can 
receive essentially the same treatment. The corresponding optimal stopping rule in 
A^ will be defined by 

for r ~ 1,2, . . . , N — 1, with V/^ defined recursively as 

K^^i = mm{lr-i,J fg-'dnie) + J V,"" dpi{xr)}, (55) 

starting from r = iV, in which case = In- 

In the Bayesian context of Remark [2] the optimality of - ([55)1 with Xij ~ 
TTiLij, where Lij are some non- negative losses, i ^ j, can be derived also from 
Theorem 5.2.2 [5]. Our Theorem 2] gives, additionally to that, a necessary condition 
of optimality, providing the structure of all Bayesian truncated tests. Essentially, 
they are randomizations of (|54p : 



forr = 1,2,...,7V-1. 
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In purely Bayesian context, such conditions may be irrelevant, because any Bayesian 
test gives the same (minimum) value of the Bayesian risk. Nevertheless, for our (con- 
ditional) Problems I and II, it may be important to have a broader class of optimal 
tests, for easier compliance with ([9]) in Theorem [T] (or with (jlSp in Theorem [2|), just 
like the randomization of decision rule is important for finding tests with a given 
a-level in the Neyman-Pearson problem (see, for example, [7]). 



5 GENERAL STOPPING RULES 

In this section we characterize the structure of general stopping rules minimizing 
L(^). 

Let us define for any stopping rule ip 

N-l „ 

LnW = E / (1 - ^i) • • • (1 - ^Jn^i)Mnfe + ln)dfi" 



+ y (1 - ^i) ... (1 - Viv-i) {Nf^ + In) dix"". (56) 

(cf. (|42)) ). This is the Lagrange-multiplier function for ^jj truncated at iV, i.e. the 
rule with the components = (V'l, '02, ■ ■ • , i^N-i, 1, ■ • ■ )j Lfq{'ip) = L(V'^). 

Because if)^ is truncated, the results of the preceding section apply, in particular, 
the inequalities of Theorem [H 

The idea of what follows is to make ^ oo, to obtain some lower bounds for 
L(V') from gSD - (gg]). 

To be able to do this, we need some "approximation properties" for L{'ip), to 
guarantee that LmH^) L{^), as — » cxj, at least for stopping rules -0 for which 
Pe{T^ < oo) = 1. 

Lemma 4. Suppose that ^ is a stopping rule such that Pg{T^ < oo) = 1. 
(i) If L{ip) < oo and 

(1 - -0i) ... (1 - '4)n-i)lndn'^ ^0, as n ^ oo, (57) 



then 



hm LnW = L(V'). 



(ii) If L{ip) — oo then Lf4{ip) oo. 

Proof. Let L{%p) < oo. Let us calculate the difference between L{ip) and Ln{iI^) in 
order to show that it goes to zero as ^ oo. By (|56p 



n=l 
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X] y"(l - V-i) • • • (1 - i^n-i)Mnfe + IrOdfi'^ 
(1 - V'l) ... (1 - i'N-i) {Nfg'' + In) dfi"" 

^ y (1 - V^l) . . . (1 - Vn-OV-nKe" + ln)dfi'' 



(1 - Vi) ... (1 - ^AT-i) (iV/e^ + /iv) d/i"^. (58) 

The first summand converges to zero, as iV ^ oo, being tlie tail of a convergent 
series (this is because L^ip) < oo). 
We have further 

'(1 - ^i) • . . (1 - V'JV-O^ivdA*'^ ^ 



as iV ^ oo, because of ([57|) . 
It remains to show that 



(1 - V'l) ■ . . (1 - i^N-i)Nf^dfi^ = NPe{T^ > iV) ^ as -> oo. (59) 
But this is again due to the fact that L{ip) < oo which impUes that 

oc 

EeTt(, = ^ nPg{T^ =n) < oo. 



n=l 



Because this series is convergent, Y^^=n^Ps{t^ — n) Q. Thus, using the Cheby- 
shev inequahty we have 

oo 

NPe{T^, >N)< EeT^,I{r^>N} - nPg{T^, = n) ^ 



as iV ^ oo, which completes the proof of 
Let now L{'ip) = oo. 
This means that 



oo „ 

J2 y (1 - ^i) ... (1 - i^n-i)M''ife + ^«)rfM" = oo 



n=l 

which immediately implies by (|56p that 



n=l 



LnW >J2 / (1 - ^i) ... (1 - ^^n-i)Mnfe + ^n)dA*" 



□ 
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Lemma 21 gives place to the following definition. 

Let us say that our testing problem is truncatable if (|57p holds for any ip with 

EgT^ < OO. 

From Lemma m it immediately follows 
Corollary 1. For any truncatable problem 

Ln^tP) L{ip), as N ^ oo, 
for any stopping rule tp such that Pe{T^ < oo) = 1. 



Remark 7. It is obvious from ([57)1 that a testing problem is truncatable, in partic- 
ular, if 

J Ind^i" ^0, as n ^ oo. (60) 

Let us denote by aij{n,(j)) the error probability of a test corresponding to a fixed 
number n of observations, when the decision rule (j) is applied. From Theorem [3] it 
follows that the left-hand side of (|60p is the minimum weighted error sum: 

y ;„d^" = inf ^ Xijaij{n,(l}), 

where the infimum is taken over all decision rules 4>. 

Thus, (I60p requires a very natural behaviour of a statistical testing problem, 
namely that the minimum weighted error sum, over all fixed-sample size tests, tend 
to zero, as the sample size n tends to infinity. 

Remark 8. Any Bayesian problem (with N{ip) — '^'^^i N {9i; ^)7ri in ((T]), where 
TT,; > 0, i = 1, . . . , fc) is truncatable. Indeed, if N{ip) < oo then Eg.r^ = N{9i, tp) < 
oo for all i = 1, . . . , fc. Because of this, 

k 

y (1 - Vl) . . . (1 - i^n-l)lndt^" < y (1 - ^l) ... (1 - ^n-lKj2 ^^^foj^f^" 
k 

< ^^AiiPg. (r^ >n)^0, as n oo, 

i=2 

thus, (153 is fulfilled. 



i=2 



Our main results below will refer to truncatable testing problems. 
To go on with the plan of passing to the limit, as ^ oo, in the inequalities of 
Theorem m let us turn now to the behaviour of Vj^ , as iV ^ oo. 

Lemma 5. For any r > 1 and for any N > r 

> Vf+\ (61) 
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Proof. By induction over r = A^, iV — 1,...,1. 
Let r^N. Then by ^ 

V^+' = mm{lM, + J V^+,'dti{xM+i)} <In = . 
If we suppose that (pT|) is satisfied for some r, N > r > 1, then 
K^i = min{Z,_i,/r' + / K''d^lixr)} 



Thus, ((6T|) is satisfied for r — 1 as well, which completes the induction. □ 



It follows from Lemma [S] that for any fixed r > 1 the sequence Vj^ is non- 
increasing. So, there exists 

Vr = lim 1/^. (62) 



Now, everything is prepared for passing to the limit, as — > oo, in (j48|l and (|49|) 
with = V'^- If Pe{Tj(, < oo) = 1, then the left-hand side of (|48|l by Lemma [4] tends 
to L{ijj), whereas passing to the limit in the other two parts under the integral sign 
is justified by the Lebesgue monotone convergence theorem, in view of Lemma [5] 
For the same reason, passing to the limit as — * oo is possible in ([50]) (see (jSTjl ). 

In this way, for a truncatable testing problem we get the following 

Theorem 5. Let be any stopping rule. Then for any r > 1 the following inequal- 
ities hold 



LW > ^ / (1 - Vi) • • • (1 - iPn-i)Mnf^ + ln)dtx^ 

n=l 

+ 1(1 - V-i) . • . (1 - i'r) ((r + 1)/^' + ^^r+l) d^/+' (63) 

r— 1 

> ^ /(I - ^l) . . . (1 - lPn-l)i^n{nf 'g + ln)dfi^ 
n=l •' 

+ y (1 - ^i) . . . (1 - i^r-i) {rfl + Vr) d^Ji\ (64) 
K„ = min{/„„ + i?,„} (65) 

Rm= Rm{xi,. ■ . ,Xm) ^ Vjn+l{xi, . . . , Xm+l}dfi{Xjn+l) (66) 



where 
with 



for any m > 1 . 

In particular, the following lower bound holds true: 



L{^) > 1 + y" VMxi) = 1 + i?o. (67) 
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In comparison to Theorem 31 Theorem [S] is lacking a very essential element: the 
structure of the test achieving the lower bound on the right-hand side of (j67p . In 
case this test exists, by virtue of (p7)) it has to be optimal. 

First of all, let us show that if the optimal test exists, it reaches the lower bound 
on the right-hand side of ([57)1 . More exactly, we prove 

Lemma 6. For any truncatable testing problem 

infL(V') = l + i?o- (68) 

Proof. Let us denote 

f/ = infL(V'), Un = 1 + Ro, 

where Rq is defined in Theorem ID 
By Theorem m for any iV = 1, 2, . . . 

Un = inf L{iP). 

Obviously, Un > U for any N — 1,2, ... , so 

lim Un > U. (69) 

Let us show first that in fact there is an equality in ([59)) . 

Suppose the contrary, i.e. that limjv^ooC^Af — U + Ac, with some e > 0. We 
immediately have from this that 

UN>U + 3e (70) 

for all sufficiently large N. 

On the other hand, by the definition of U there exists a ip such that U < L{ip) < 
U + e. 

Because, by LemmalU LNii^) L{tp), as N ^ oo, we have that 

LN{i^)<U + 2e (71) 

for all sufficiently large N as well. Because, by definition, Ln{iP) > C/jv, we have 
that 

Un <U + 2e 



for all sufficiently large TV, which contradicts fTO)) . 
Thus, 

lim Un = U. 

Now, to get (1551) we note that, by the Lebesgue's monotone convergence theorem, 
U= lim Un = 1+ lim / {x)dn{x) = 1 + / Vi{x)dn{x) = 1 + Ra, 
thus, U = 1 + Ro. □ 
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Remark 9. For the Bayesian context (see Remark [5]), Lerama [5] can be derived 
from Theorem 5.2.3 [5^ if is supposed (see also Section 7.2 of [3] or Section 9.4 
of [U). 

The following theorem gives the structure of the optimal stopping rule for a 
truncatablc testing problem. 



Theorem 6. 



L(^) = infL(V''), (72) 



if and only if 

I{im<fip+R,„} < "^ni < I{u<f^^+B.„,} fi"" -almost anywhere on Cf^ (73) 
for all m — 1,2.... 

Proof. Let ip be any stopping rule. By Theorem O for any fixed r > 1 the following 
inequalities hold: 



i(V') > ^ / (1 - V^i) . . . (1 - i:n-i)Mnf 'e + ^n)rfM" 

n=l 

+ y (1 - ^i) . . . (1 - ^r) ((r + 1)/;+^ + K+l) dM(^r+l) (74) 

r-1 

^ y (1 - Vi) . . . (1 - + ;„)dAi" 



r-1 

> 



+ y (1 - Vi) . . . (1 - V-r-i) (r/^' + V;) dy.{xr) (75) 
> . . . 

> 



V'i(/e' + h)d^^ + y ~ + Mx2) (76) 

> 1 + y Vidfi{xi) = 1 + i?o. (77) 

Let us suppose that L{ip) = 1 + i?o- Then, by Lemma[6l there are equalities in all 
the inequalities ([74|) - ((77l) . Applying the "only if -part of Lemma [2] and using ((65l) 
and (pS)) . successively, starting from the last inequality ([77)l . we get that (|75|l has to 
be satisfied for any m = 1,2,.... The first part of the Theorem is proved. 

Let now tp be any test satisfying (|75|) . 

Applying the "if'-part of Lemma[2]and using ([55)1 and again, we see that all 
the inequalities in (|75|) - ((77| are in fact equalities for 



-0'' = (-0i,V2,---,V'r,l,---)- 

In particular, this means that there exists 



lim [V /(I - 0i) ... (1 - A^-i)Mr^fe + ^n)^^" 
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+ y (1 - Vi) ■ • ■ (1 - A) {{r + 1)/;+' + Vr+i) dnixr+i)] = l + Ro. (78) 

It follows from ([TH]) that 

limsup^^^ /(I - '0i) ... (1 - i^r)ir + 1) fg^'^ d^i{xr+i) 
= limsup^^^(r + l)Pe{T^ > r + 1) < 1 + i?o, 

which implies that limr^oo PeiT^p > r + 1) = 0. Thus, Peir^ < oo) = 1. 
From ([75]). it follows as well that 

lim V /[(I - ^i) ... (1 - i^„_i)V„(7i/e" + ;„)d/i" < 1 + i?o. (79) 

r — ^oo ^ — ^ / 

But the left-hand side of ([79]) is i(V') (because Pe{T^ < oo) = 1) and hence 

L{^)<l + Ro. (80) 

On the other hand, by virtue of Theorem [5l 

L{yj) > l + i?o, 

which proves, together with ([SO]) , that L{tp) = 1 + Rq. □ 

Remark 10. Once again (see Remark[5]), the optimal stopping rule tp from Theorem 
[6] only makes practical sense if Zq > 1 + ^^o. 

Remark 11. From the results of this section it is not difficult to see that the same 
method works as well for minimizing 

N{'tp)dTT{e) 

(see Remark E]). 

Repeating the steps which led us to Theorem |6] we get that the corresponding 
optimal stopping rule has the form 

ijr = U f f r = 1,2,3,..., (81) 

with 

Vr = lim V^^, 
being defined forr = iV— l,iV — 2,...,1 recursively by 



starting from = In- 



Optimal Sequential Multiple Hypothesis Tests 



21 



In a particular case of Remark [5] and 

/'7V(0;^)d7r(0) =^^,7V(0,;^) 

being Xij = LijiTi, this gives an optimal stopping rule for the Bayesian problem 
considered in [Ij. 

In particular, for fc = 2, this gives an optimal stopping rule for the Bayesian 
problem considered in [3]. 

6 APPLICATIONS TO THE CONDITIONAL PROB- 
LEMS 

In this section, we apply the results obtained in the preceding sections to minimizing 
the average sample size N^ip) — Eqt^ over all sequential testing procedures with 
error probabilities not exceeding some prescribed levels (see Problems I and II in 
Section 1). Recall that we are supposing that our problems are truncatable (see 
Section 5). 

Combining Theorems [U [3] and [51 we immediately have the following solution to 
Problem I. 

Theorem 7. Let ip satisfy ^7S\ ) for allm — 1,2,..., with any Xij > 0, i,j = 1, . . . ,k, 

i ^ j, (recall that Rm and l„i in \7S^ are functions of Xij), and let 4> be any decision 
rule satisfying \20\) . 

Then for all sequential testing procedures (^', 4>') such that 

aij {ip', <j>') < Uij {tp, (p) for all i,j = l,...,k,i^ j, (82) 

it holds 

N{i>') > N{iP). (83) 

The inequality in i83\} is strict if at least one of the inequalities in i8^) is strict. 
If there are equalities in all of the inequalities in i8S^) and i8S\) . then ip' satisfies 
^73\ l for all m ^ 1,2, .. . as well (with ^/ instead of tp). 

Proof. The only thing to be proved is the last assertion. 
Let us suppose that 

a^j{ip',(p') = aij{tp,<p), for all i, j = 1, . . . , k, i ^ j, 

and 

N{iP') = N{iP). 

Then, obviously, 

L(^', (/.') = Li^, cP) = Li^) > L{^') (84) 

(see (O and Remark [3]). 

By Theoremini there can not be strict inequality in ([M)) . so L{ip) = L{ip'). From 
Theorem [5] it follows now that ■(/;' satisfies (|75)) as well. □ 
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Analogously, combining Theorems [U [3] and [SI we also have the following solution 
to Problem II. 

Theorem 8. Let ip satisfy for all m = 1,2,..., with Xij = \i for all j ~ 
1, . . . , /c, where Xi > 0, i — 1, . . . k are any numbers, and let (j) he any decision rule 
such that 

for all j = 1, . . . , k and for all n — 1,2, ... . 

Then for any sequential test (ip' , (j)') such that 

|3^i^',<|>')<P^{'^P,4>) for any i^l,...,k, (85) 

it holds 

N{^') > N{i;). (86) 

The inequality in i86]) is strict if at least one of the inequalities in i85\) is strict. 
If there are equalities in all of the inequalities in i85]) and i86]) . then tJj' satisfies 
( [75'p for all m = 1,2, .. . as well (with ip' instead of tp). 

Remark 12. There are examples of applications of Theorem [7] (or [5]), in the case 
of two simple hypotheses based on independent observations, in ^9j. 

A numerical example related to the modified Kiefer- Weiss problem for indepen- 
dent and identically distributed observations can be found in [5]. Obviously, our 
Theorem [7] provides, for this particular case, randomized versions of the optimal 
sequential test studied in [8] (see also [9]). 
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